Towards the automatic processing of Yongning Na (sino-tibetan): developing a 'light' acoustic model of the target language and testing 'heavyweight' models from five national languages
نویسندگان
چکیده
Automatic speech processing technologies hold great potential to facilitate the urgent task of documenting the world’s languages. The present research aims to explore the application of speech recognition tools to a littledocumented language, with a view to facilitating processes of annotation, transcription and linguistic analysis. The target language is Yongning Na (a.k.a. Mosuo), an unwritten Sino-Tibetan language with less than 50,000 speakers. An acoustic model of Na was built using CMU Sphinx. In addition to this ‘light’ model, trained on a small data set (only 4 hours of speech from 1 speaker), ‘heavyweight’ models from five national languages (English, French, Chinese, Vietnamese and Khmer) were also applied to the same data. Preliminary results are reported, and perspectives for the long road ahead are outlined.
منابع مشابه
مدل دو مرحله ای شکاف- گلچین برای نمایه سازی خودکار متون فارسی
Purpose: Each language has its own problems. This leads to consider appropriate models for automatic indexing of every language. These models should concern the exhaustificity and specificity of indexing. This paper aims at introduction and evaluation of a model which is suited for Persian automatic indexing. This model suggests to break the text into the particles of candidate terms and to c...
متن کاملمقایسه روش های طیفی برای شناسایی زبان گفتاری
Identifying spoken language automatically is to identify a language from the speech signal. Language identification systems can be divided into two categories, spectral-based methods and phonetic-based methods. In the former, short-time characteristics of speech spectrum are extracted as a multi-dimensional vector. The statistical model of these features is then obtained for each language. The ...
متن کاملAttitudes towards English as an International Language (EIL) in Iran: Development and Validation of a New Model and Questionnaire
This study aimed at developing and validating a new model and instrument to explore attitudes of Iranian EFL learners towards English as an International Language (EIL). In so doing, the researchers followed several rigorous steps including extensive literature review, content selection, item generation, designing the rating scales and personal information part, Delphi technique, item revision,...
متن کاملThe tone patterns of numeral-plus-classifier phrases in Yongning Na: a synchronic description and analysis
Level-tone systems in the Sino-Tibetan family are now well-attested, and increasingly well-described (about Pumi, see Ding 2006 and Jacques 2011; about Hakha Lai: Hyman and VanBik 2002, 2004; see also the synthesis by Evans 2008). The present study aims to contribute to this strand of research by describing a specific area of the tonal morphology of the Yongning Na language, namely the tone pat...
متن کاملThe Impact of Learning Styles on the Iranian EFL Learners' Input Processing
This research study explored the impact of learning styles and input modalities on the second language (L2) learners' input processing (IP). This study also sought to appraise the usefulness of Processing Instruction (PI) and its components in relation to the learners' learning styles and input modalities. To this end, 73 male and female Iranian EFL learners from Islamic Azad University, North ...
متن کامل